Plausible Self-Organizing Maps for Speech Recognition
نویسنده
چکیده
A major problem in connectionist phonetic acoustic decoding is the way to present acoustic signal to the network. Neurobiological data about the inner ear and the primary auditory cortex can be very helpful but are rare. On the other hand, other biological works have shown that structure and functioning of the visual cortex, which have been extensively studied, are very close to the structure and functioning of the auditory cortex. It has been shown that a simple two-layered network of linear neurons can organize itself to extract the complete information contained in a set of presented patterns. This model has been applied to visual information and has revealed orientation and spatial frequency selective cells. Such principles have been applied to speech recognition. So we have designed two kinds of maps. The first kind is able to represent frequency characteristics of the signal (e.g.: formantic structure). The second kind takes into account dynamic aspects of the signal (e.g.: formantic transition). First, these results have been analyzed both from a phonetic and a signal processing point of view and show very interesting representation of the signal. Second, these representations can be used as the input map of a dynamic connectionist network, for speech recognition. The input maps have a selective activity with regard to phonemic structures, and enable dynamic networks to differentiate the phonemes.
منابع مشابه
Self-organizing Maps for Speech Recognition
The spoken speech is the easiest and most natural way for the communication between human beings. So, the human-machine communication can be executed based on the way that human-human communication occurs. Researches in automatic speech recognition (ASR) have been developed for decades to produce communication as natural as possible. There some few attempts to use Self-organizing Maps to solve ...
متن کاملArchitecture Optimization Model for the Probabilistic Self-organizing Maps
The PRobabilistic Self-Organizing Maps (PRSOM) become more and more interesting in many fields such as: pattern recognition, clustering, classification, speech recognition, data compression, medical diagnosis, etc. The PRSOM give an estimation of the density probability function of the data, which depends on the parameters of the PRSOM, such as the architecture of the network. When we take a ra...
متن کاملNovel Approach for Speech Recognition by Using Self – Organized Maps
The method of self-organizing maps (SOM) is a method of exploratory data analysis used for clustering and projecting multi-dimensional data into a lower-dimensional space to reveal hidden structure of the data. The Self-Organizing Feature Maps (SOFMs) [11] is a class of neural networks capable of recognizing the main features of the data they are trained on. There is extensive literature on its...
متن کاملSteel Consumption Forecasting Using Nonlinear Pattern Recognition Model Based on Self-Organizing Maps
Steel consumption is a critical factor affecting pricing decisions and a key element to achieve sustainable industrial development. Forecasting future trends of steel consumption based on analysis of nonlinear patterns using artificial intelligence (AI) techniques is the main purpose of this paper. Because there are several features affecting target variable which make the analysis of relations...
متن کاملImproved Self-Organizing Maps and Speech Compression
Recent developments of Self-Organizing Maps or Kohonen networks become more and more interesting in many fields such as: pattern recognition, clustering, speech recognition, data compression, medical diagnosis... Kohonen networks is unsupervised learning models. The results obtained by the Kohonen networks are dependent on their parameters such as the architecture of the Kohonen map, the later ...
متن کاملWarpNet: Self-Organizing Time Warping
Motorola, Phoenix Corporate Research Laboratories, 2100 East Elliot Road, MD EL508, Tempe, AZ 85284, USA tel: (602)413-4129, fax: (602)413-7281, email: [email protected] Abstract We describe “WarpNet”, a time-warping algorithm for speech recognition based on neural nets. WarpNet is a self-organizing matching mechanism intended to replace dynamic programming (Dynamic Time Warping or Viterbi-a...
متن کامل